# Edge device optimization

Phi 2 GGUF
MIT
phi-2 is a text generation model employing IQ-DynamicGate ultra-low bit quantization (1-2 bits), suitable for natural language processing and code generation tasks.
Large Language Model Supports Multiple Languages
P
Mungert
472
2
Orpheus 3b 0.1 Ft GGUF
Apache-2.0
An ultra-low bit quantized model optimized based on the Llama-3-8B architecture, utilizing IQ-DynamicGate technology for adaptive 1-2 bit precision quantization, suitable for memory-constrained environments.
Large Language Model English
O
Mungert
1,427
1
Olympiccoder 32B GGUF
Apache-2.0
OlympicCoder-32B is a code generation model based on Qwen2.5-Coder-32B-Instruct, employing IQ-DynamicGate ultra-low-bit quantization technology for efficient inference in memory-constrained environments.
Large Language Model English
O
Mungert
361
3
EXAONE Deep 32B GGUF
Other
EXAONE-Deep-32B is a 32B-parameter large language model supporting English and Korean, specifically designed for text generation tasks.
Large Language Model Supports Multiple Languages
E
Mungert
2,249
3
Llama 3.1 Nemotron Nano 8B V1 GGUF
Other
An 8B parameter model based on the Llama-3 architecture, optimized for memory usage with IQ-DynamicGate ultra-low bit quantization technology
Large Language Model English
L
Mungert
2,088
4
EXAONE Deep 7.8B GGUF
Other
A 7.8B-parameter model featuring ultra-low-bit quantization (1-2 bits) using IQ-DynamicGate technology, supporting English and Korean text generation tasks.
Large Language Model Supports Multiple Languages
E
Mungert
1,791
5
Mistral Small 3.1 24B Instruct 2503 GGUF
Apache-2.0
This is an instruction-tuned model based on Mistral-Small-3.1-24B-Base-2503, utilizing GGUF format and IQ-DynamicGate ultra-low bit quantization technology.
Large Language Model Supports Multiple Languages
M
Mungert
10.01k
7
Qwen2.5 7B Instruct 1M GGUF
Apache-2.0
Qwen2.5-7B-Instruct-1M is an instruction-tuned version based on Qwen2.5-7B, employing IQ-DynamicGate ultra-low-bit quantization (1-2 bits), suitable for efficient inference in memory-constrained environments.
Large Language Model English
Q
Mungert
1,342
4
Llama 3.1 8B Instruct GGUF
Llama-3.1-8B-Instruct is an instruction-tuned version based on Llama-3-8B, utilizing IQ-DynamicGate technology for ultra-low-bit quantization (1-2 bits), enhancing accuracy while maintaining memory efficiency.
Large Language Model Supports Multiple Languages
L
Mungert
1,073
3
Mistral 7B Instruct V0.2 GGUF
Apache-2.0
Mistral-7B-Instruct-v0.2 is an instruction-tuned model based on the Mistral-7B architecture, supporting text generation tasks, optimized for memory efficiency using IQ-DynamicGate ultra-low bit quantization technology.
Large Language Model
M
Mungert
742
2
Reasonablellama3 3B Jr
A fine-tuned reasoning model based on LLaMA-3B, enhanced with reasoning capabilities and multilingual processing support
Large Language Model Supports Multiple Languages
R
adeelahmad
1,173
6
Tiny Agent A 3B
Other
Mini Agent-α is a lightweight AI agent trained on the Qwen2.5-Coder model series, specifically designed for edge devices, supporting Pythonic function calling methods.
Large Language Model Supports Multiple Languages
T
driaforall
207
13
Comment Moderation
Openrail
A multi-label content moderation system built on the DistilBERT architecture for detecting and classifying potentially harmful content in user comments, featuring high accuracy and lightweight characteristics.
Text Classification Transformers English
C
Vrandan
45.47k
1
Tiny Hinglish Chat 21M
MIT
A micro Hindi-English mixed dialogue text completion model capable of conversing on daily life topics in Hinglish.
Dialogue System Transformers English
T
Abhishekcr448
178
4
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase